
1 TTTCTTCTGT TTGCTTACTC CCTATCCGGG GGCCCAAGGC GCTGTCTCCG 

51 CCGCCCAAGC CCCGCGTAAA CCTGGGTGAC CTCGGAGACA TCCGTTGGAG 

101 CATGAGTTCC CGACATCAGG CGGCGGCGGT GGTCCGGGAG AAACCCGGCG 

151 GCGGGGAGAT AAGCCTGCCC AGGAGGCAGG GGGCTGGGCT AGCTGCCCCG 

201 CCCCGCGCCT GACTTCGTTG GGGAGGGAGA CGCCCGGCTC CCGCCCCTAA 

251 CTAGCCCAGC CGCGCGGAGC GCCTGGGAGA GGAGAAGGAG CCGACCTGCC 

301 GAGATGGAGG CGACCGGCAC CTGGGCGCTG CTGCTGGCGC TGGCGCTGCT 

351 CCTGCTGCTG ACGCTGGCGC TGTCCGGGAC CAGGGCCCGA GGCCACCTGC 

401 CCCCCGGGCC CACGCCGCTA CCACTGCTGG GAAACCTCCT GCAGCTACGG 

451 CCCGGGGCGC TGTATTCAGG GCTCATGCGG CTGAGTAAGA AGTACGGACC 

501 GGTGTTCACC ATCTACCTGG GACCCTGGCG GCCTGTGGTG GTCCTGGTTG 

551 GGCAGGAGGC TGTGCGGGAG GCCCTGGGAG GTCAGGCTGA GGAGTTCAGC 

601 GGCCGGGGAA CCGTAGCGAT GCTGGAAGGG ACTTTTGATG GCCATGGGGT 

651 TTTCTTCTCC AACGGGGAGC GGTGGAGGCA GCTGAGGAAG TTTACCATGC 

701 TTGCTCTGCG GGACCTGGGC ATGGGGAAGC GAGAAGGCGA GGAGCTGATC 

751 CAGGCGGAGG CCCGGTGTCT GGTGGAGACA TTCCAGGGGA CAGAAGGACG 

801 CCCATTCGAT CCCTCCCTGC TGCTGGCCCA GGCCACCTCC AACGTAGTCT 

851 GCTCCCTCCT CTTTGGCCTC CGCTTCTCCT ATGAGGATAA GGAGTTCCAG 

901 GCCGTGGTCC GGGCAGCTGG TGGTACCCTG CTGGGAGTCA GCTCCCAGGG 

951 GGGTCAGACC TACGAGATGT TCTCCTGGTT CCTGCGGCCC CTGCCAGGCC 

1001 CCCACAAGCA GCTCCTCCAC CACGTCAGCA CCTTGGCTGC CTTCACAGTC 

1051 CGGCAGGTGC AGCAGCACCA GGGGAACCTG GATGCTTCGG GCCCCGCACG 

1101 TGACCTTGTC GATGCCTTCC TGCTGAAGAT GGCACAGGAG GAACAAAACC 

1151 CAGGCACAGA ATTCACCAAC AAGAACATGC TGATGACAGT CATTTATTTG 

1201 CTGTTTGCTG GGACGATGAC GGTCAGCACC AC GGTCGGCT ATACCCTCCT 

1251 GCTCCTGATG AAATACCCTC ATGTCCAAAA GTGGGTACGT GAGGAGCTGA 

1301 ATCGGGAGCT GGGGGCTGGC CAGGCACCAA GCCTAGGGGA CCGTACCCGC 

1351 CTCCCTTACA CCGACGCGGT TCTGCATGAG GCGCAGCGGC TGCTGGCGCT 

1401 GGTGCCCATG GGAATACCCC GCACCCTCAT GCGGACCACC CGCTTCCGAG 

1451 GGTACACCCT GCCCCAGGGC ACGGAGGTCT TCCCCCTCCT TGGCTCCATC 

1501 CTGCATGACC CCAACATCTT CAAGCACCCA GAAGAGTTCA ACCCAGACCG 

1551 TTTCCTGGAT GCAGATGGAC GGTTCAGGAA GCATGAGGCG TTCCTGCCCT 

1601 TCTCCTTAGG GAAGCGTGTC TGCCTTGGAG AGGGCCTGGC AAAAGCGGAG 

1651 CTCTTCCTCT TCTTCACCAC CATCCTACAA GCCTTCTCCC TGGAGAGCCC 

1701 GTGCCCGCCG GACACCCTGA GCCTCAAGCC CACCGTCAGT GGCCTTTTCA 

1751 ACATTCCCCC AGCCTTCCAG CTGCAAGTCC GTCCCACTGA CCTTCACTCC 

1801 ACCACGCAGA CCAGATGAAG GAAGGCAACT TGGAAGTGGT GGGTGCCCAG 

1851 GACGGTGCCT CCAGCCTCAA CAGTGGGCAT GGACAGGGTT AATGTCTCCA 

1901 GAGTGTACAC TGCAGGCAGC CACATTTACA CGCCTGCAGT TGTTTTCCGG 

1951 AGTCTGTCCC ACGGCCCACA CGCTCACTTG ACTCATGCTG CTAAGATGCA 

2001 CAACCGCACA CCCATACACA ACTACAAGGG CCACAAAGCA ACTGCTGGGT 

2051 TAGCTTTCCA CAGACATAAA TATAGTCCAT CTGCAATCAC AAGCACATAG 

2101 CCAGGTAACC CACCAACTCC CCTGGATCTG CAGCCCACAC GTGGGAGTCT 

2151 GGCTGTCACC TTCACAAGCC ACAGAAACGG CCACACATGT TCACAGCTCA 

2201 CACGCCCTCT CCATTCATCG AACTTCTCAG TGTCCCTGTC CCTGGTGCCT 

2251 GGCACAGGGA ACAGCATGCC CCCTCCGGGG TCATGCCACC CAGAGACTGT 

2301 CGCTGTCTAT GGCCCCAACT CATGCTCCCT CTCTTGGCTA CACCACTCTC 

2351 CCAGCCTGTG ACCACCGATG TCCACAGACC CCCAACCACT TGTCCACACA 

2401 GCTACCCACG TACGACATCG TCCTGGCTCC CCAGAGTATC TTCCCACTGA 

2451 GACACGCCGC CCCCACAGAG GCACAGTCCC CAGCCACCTC TGCAACTGCA 

2501 GCCCTCAGTC ACCCCTTTTT AAGCACCCTG ATTCTACCAA ATGCAAACAC 

2551 ATCTGGGTCT GCGATTATGC ACAGAGACTT TGGACATACG AGGACCCTCA 

2601 GACCGGAGGA ACACCTGCCC AACCCCAACA CGTGCTTATG TAACCACGTG 

2651 GAAAGCGGCC CCTGCTGCCC CTCCACACAC ACATACACAC TCACTGATCT 

2701 ACAGCCCCTG TTCGGCGTCA GAGTCCCCAC TAGACCCAGT GGAAGGGGTT 

2751 AGAGACCAAG TAGGGGCCAG TTTCCAATTC ACCCTGTCAG GGAGTGAGCC 

2801 GGATCTGACG TTCCTTGTGA CTTAAGGGTC CGGCTTGGGA ATTAAAGTTT 

2851 GTTTCTGGCC TTTAGCCTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

2901 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 



FEATURES: 

5'UTR: 1-303 
Start Codon:304 
Stop Codon: 1815 
3'UTR: 1818 
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HOMOLOGOUS PROTEINS: 

gi | 117254 lsp]P24461!CPGl_RABIT CYTOCHROME P450 2G1 (CYPIIG1) (P.. 
gi |404777igb[AAA31432.1| (L10912) cytochrome P-450 2B-Bx [Oryct.. 
gi I 479930 Jpir | | S35666 cytochrome P450 2B4 isoform Bx - rabbit >.. 
gi|117212|sp|P00178|CPB4_RABIT CYTOCHROME P450 2B4 (CYPIIB4) (P.. 
gi[320075|pir| 1S31277 cytochrome P450 2B4-B1 - rabbit >gi|21369.. 
gi|4731350|gblAAD28466.1| (AF128849) cytochrome P450 2B10 relat.. 
gi J 3123191 1 sp 1 P04167 j CPB2_RAT CYTOCHROME P450 2B2 (CYPIIB2) (P4.. 
gi| 117213 |sp|Pl2789|CPB5_RABIT CYTOCHROME P450 2B5 (CYPIIB5) (P.. 
gi[89973ipir| |A27717 cytochrome P450 2B5, hepatic (formHPl) - . . 
gi|2144292|pir| j 04RTP2 cytochrome P450 2B2 - rat 



gb|BE148597|BE148597 MR0-HT0241-150500-01Q-b02 HT0241 Homo sapi... 
gb|BF359243 IBF359243 RC6-ET0072-150600-011-F01 ET0072 Homo sapi... 
gb|AW753778|AW753778 RCl-CT0286-301099-011~f 08 CT0286 Homo sapi... 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gblBE148597| head_neck 
gb|BF359243t lung_tumor 
gb(AW753778{ colon 

Tissue expression : 
Human leukocyte 



516 e-145 

513 e-144 

513 e-144 

508 e-143 

508 e-143 

508 e-143 

508 e-143 

507 e-142 

507 e-142 

506 e-142 



1091 0.0 
779 0.0 
652 0.0 
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1 MEATGTWALL LALALLLLLT LALSGTRARG HLPPGPTPLP LLGNLLQLRP 

51 GALYSGLMRL SKKYGPVFTI YLGPWRPVVV LVGQEAVREA LGGQAEEFSG 

101 RGTVAMLEGT FDGHGVFFSN GERWRQLRKF TMLALRDLGM GKREGEELIQ 

151 AEARCLVETF QGTEGRPFDP SLLLAQATSN VVCSLLFGLR FSYEDKEFQA 

201 VVRAAGGTLL GVSSQGGQTY EMFSWFLRPL PGPHKQLLHH VSTLAAFTVR 

251 QVQQHQGNLD ASGPARDLVD AFLLKMAQEE QNPGTEFTNK NMLMTVIYLL 

301 FAGTMTVSTT VGYTLLLLMK YPHVQKWVRE ELNRELGAGQ APSLGDRTRL 

351 PYTDAVLHEA QRLLALVPMG IPRTLMRTTR FRGYTLPQGT EVFPLLGSIL 

401 HDPNIFKHPE EFNPDRFLDA DGRFRKHEAF LPFSLGKRVC LGEGLAKAEL 

451 FLFFTTILQA FSLESPCPPD TLSLKPTVSG LFNIPPAFQL QVRPTDLHST 

501 TQTR 



FEATURES: 

Functional domains and key regions : 

Prosite search results: 

[1] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



2 99-101 SGR 

3 248-250 TVR 

4 288-290 TNK 

5 378-380 TTR 

6 473-475 SLK 



[3] PDOC00006 PS00006 CK2_PHOSPHOjSITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 119-122 SNGE 

2 192-195 SYED 

3 343-346 SLGD 



[4] PDOC00008 PS00QO8 MYRISTYL 
N-myristoylation site 

Number of matches: 10 

1 51-56 GALYSG 

2 109-114 GTFDGH 

3 115-120 GVFFSN 

4 188-193 GLRFSY 

5 207-212 GTLLGV 

6 257-262 GNLDAS 

7 284-289 GTEFTN 

8 339-344 GQAPSL 

9 370-375 GIPRTL 
10 444-449 GLAKAE 



[5] PDOC00009 PS00009 AMI DAT I ON 
Amidation site 

Number of matches: 2 

1 140-143 MGKR 

2 435-438 LGKR 



[6] PDOC00029 PS00029 LEUCINE_ZIPPER 
Leucine zipper pattern 

Number of matches: 2 

1 32-53 LPPGPTPLPLLGNLLQLRPGAL 

2 39-60 LPLLGNLLQLRPGALYSGLMRL 



128-131 RKFT 



[2] PDOC00005 PS00005 PKCJ?HOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches: 6 

1 61-63 SKK 
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[7] PDOC00081 PS00086 CYTOCHROME_P450 

Cytochrome P450 cysteine heme-iron ligand signature 

433-442 FSLGKRVCLG 

Membrane spanning structure and domains: 



Helix 


Begin 


End 


Score Certainty 


1 


5 


25 


2.305 Certain 


2 


64 


84 


0.783 Putative 


3 


170 


190 


1.041 Certain 


4 


292 


312 


2.031 Certain 


5 


448 


468 


1.133 Certain 
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BLAST Alignment to Top Hit: 

>gi 1 117254 |sp|P24461|CPGl_RABIT CYTOCHROME P450 2G1 (CYPIIGl) 
( P450-NMB ) ( OLFACT I VE ) 
pir| (B31944 cytochrome P450 2G1 - rabbit 
Length =494 



Score = 516 bits (1315), Expect - e-145 

Identities = 248/491 (50%), Positives - 345/491 (69%), Gaps = 3/491 (0%) 



Que ry : 


1 


MEATGTWALLLALAL-LLLLTLALSGTRARGHLPFGPTPLPLLGNLLQLRPGALYSGLMR 


59 






ME G + + LAL LL4- +A + G LPPGPTP+P LGNLLQ+R A + ++ 




Sb j ct : 


1 


nbLLKjAr lie LALl^i: bULLXL±AWi\RvyKrbKLrrlsfc'l if Let LtjN LLQVRI DAI r Qbi! Li\ 


bu 


Query: 


60 


LSKKYGPVFTIYLGPWRPWVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFS 


119 






L +KYGPVFT+Y+GP RPW+L G EAV+EAL +A+EFSGRG +A +E F GHGV + 




Sb j ct : 


a i 
bl 


hRhtiSilj&v k I v imt?f — KPVVlljCljHbAvKhALVDKAD.br bbKvjbLAbVbKJNr QtsribVALA 


1 1 Q 


Query: 


120 


NGERfATRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATS 


179 






NGERWR LR+F++ LRD GMGKR EE IQ EA L+E F+ T+G P DP+- L++ S 




Sbj ct : 


i on 


JN^LKWKlLKKc oLi ILKDr bmo&KblbbKiybbAbXLLbiiil! RKlKCjAr lDr lh c LbKl Vb 


1 TO 

i ( y 


Query: 


180 


NVVCSLLFGLRFSYEDKEFQAWRAAGGTLLGVSSQGGQTYEMFSWFLRPLPGPHKQLLH 


239 






NV+ S++FG RF YEDK+F +++R 4- + +S+ Q Y+M+S ++ LPG H ++ + 




Sb j ct : 


lou 


In VlbbV Vr boKc DibDJS.yr LbLLKMllNllibr ibMbl rWAtJLlUIXtXbbvMyiLrGKnlNKl X X 




Query; 


240 


HVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNPGTEFTNKNMLMTVIYL 


299 






+ L F +V+ ++ -KLD P RD +D FL+KM Q+-+ WP TEF KN+++T + L 




Sbj ct: 


240 


LIEELKDFIAARVKVNEASLDPQNP-RDFIDCFLIKMHQDKNNPHTEFNLKWLVLTTLNL 


298 


Query: 


300 


LFAGTMTVSTTVGYTLLLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHE 


359 






FAGT TVS+T+ Y LL+MK+P VQ + EE+N+ +G + PS+ DR ++P+TDAV+HE 




Sbjct: 


299 


FFAGTETVSSTLRYGFLLIMKHPEVQTKIYEEINQVIGPHRIPSVDDRVKMPFTDAVIHE 


358 


Query: 


360 


AQRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILHDPNIFKHPEEFNPDRFLD 


419 






QRL +VPMG+P ++R T FRGY LP+GT+VFPLLGS+L DP F HP++F P FLD 




Sbjct: 


359 


IQRLTDIVPMGVPHNVIRDTHFRGYLLPKGTDVFPLLGSVLKDPKYFCHPDDFYPQHFLD 


418 


Query: 


420 


ADGRFRKHEAFLPFSLGKRVCLGEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVS 


479 






GRF+K+EAF+PFS GKR+CLGE +A+ ELFL+FT+ILQ FSL PP + + P +S 




Sbj ct : 


419 


EQGRFKKNEAFVPFSSGKRICLGEAMARMELFLYFTSILQNFSLHPLVPPVNIDITPKIS 


478 


Query: 


480 


GLFNIPPAFQL 4 90 








G NIPP ++L 




Sbj ct : 


479 


GFGNIPPTYEL 489 





Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



PF00067 Cytochrome P450 594.4 6.9e-175 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



PF00067 1/1 33 493 1 497 [] 594.4 6.9e-175 
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1 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

1901 NNNNNNNNNN NNNNNNNNNN NNNNNTGACA GGGGCCATGA TGGAGACACC 

1951 TTGGATCGAA GAGGTCACAG CACCCTCCTC TTTCTTCCTC CCTACCCCCA 

2001 GCTGAGTAAG AAGTACGGAC CGGTGTTCAC CATCTACCTG GGACCCTGGC 

2051 GGCCTGTGGT GGTCCTGGTT GGGCAGGAGG CTGTGCGGGA GGCCCTGGGA 

2101 GGTCAGGCTG AGGAGTTCAG CGGCCGGGGA ACCGTAGCGA TGCTGGAAGG 

2151 GACTTTTGAT GGCCATGGTA AGTCAAGGGC TGCTAGGCCC TCCGCTCACA 

2201 GCCTGCCACC ACTTACTGGT GTGTGACCTT TGCACATGGC TTAGTCCCTC 

2251 TGTTGCCTCA TCTGTCAAAT GGAGTGATAA CAGTGCCCAT CAGCCGGGTG 

2301 CAGTGGCTAG TGCCTGAAAT CCCAACACTT TGGGAGGCGG AGGTGGGTGG 

2351 ATCACTTGAG GTCAGGAGTT CGAGACCAGC CTGGCCAACA TGGTGAAACC 

2401 CTGTCTCTAC TAAAAATATA AAAATTAGCT GGGCATGGTG GTGCGTACCT 

2451 GTAATCCCAG ATACTTGGGA GGTTGAGGCA GGAGAATCGC TTGAACCCGG 

2501 GAGGCAGATG TTGCAGTGAA CCAAGACTGT GCCACTGCAC TCCAGTCTGG 

2551 GCAACAGAGT GAGCCTCCAT CTCAAACAAA CAAACAAAAA GCAGTGCCCA 

2601 TCATGTAGGA TTGAGTGATT GAGTGAGGAC TGAGCCTTGT GCAAAGTGAG 

2651 CACTCACTAA TCACCAGGTT GTAGTATCAG TGATAACCAT CAATGATCCA 

2701 GGTAAAGCCC TGAGGGTTCA GAAAGATGCC GGAGCGCTTT CAAGGTGCTG 

2751 GGGATTGGTG GGCAAGCCCT CGAATAATAG AAACAGTTCT CTGTATTACA 

2801 ACAGAAAGCA GGAGGCCCAT GCTGGGTGCT GCCAGGAACT CAGTAGTAAC 

2851 TAAGACAGCA CCGGTGCTGC TTCCCCAGCG CACCTAGGCC AGTGGGGAAA 

2901 CAGACTCACC ACACAGTCCC AGCCCAGAGT GGTCAGGGCC AAGATGGGGA 

2951 AGCACGGGGA GAAAGGTCAG GGTGGGATGG GGAGGGGTCA GGGCAAGAGG 

3001 GGTCAGGGCC AGGCTGAGGG AAGCCCTGGG ACTGTAGGAA TTTAGAGGAG 

3051 GTACC TGACC CGGCATGTTT GGTGAGGGAG ATTCAGGAAG TCTTCCTGGA 

3101 AGAGAGGCTG TCGGAGCTGA GACTCATAAG ATGAGTGGGG AGGGTGTTCC 

3151 AGGCAGAAAG ACCAGCACCT ACAAAAGCAT GACTTTGAGA GAAGCATTCA 

3201 TCCATTCAAC TGATGAATTT TCAGACTGGG CACGCTGGCT CATGCCTGTA 

3251 ATCCCAGCAC TTTGGAAGGC TGAATGGGGA GGATGACTTG AGCCTAGGCA 

3301 TTTGTGACAA GCCTGGGCAA CATGGTGAGA CCCTGCCTCC ACAAAACAAA 

3351 CAAACAAACA AAAAATCATT ATACCTGGTA CCATGGGTAC CAGGTACATA 

3401 GAAATGACTC AGGCAGATAT GGTGTCCTCT CCTACTGTGG GAGAGGCGGG 

3451 CTTATACTGC AGTAAGACAA TAGAGGGAGG GAATATAATC CTAAAATGAG 

3501 AGGTACAGAT TTGAGAGCAA ACACAGGGCA CAGGCATATG TACGAGGGTA 
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3551 AAGAGGGAAT CAGGGAAGGC TTCTCAGAGA AGGTGACATT TAAGCCGGGA 

3601 CATGAAGGAT GAACGAGTTA GTTCACCAAG GATGGGATGG AAAGGGGTGA 

3651 GAGTGATGGA GGCAGAGGGA ACTGCAGGAT CATAGGCCTA GACAGGGGAT 

3701 CCTGACGCCC TTGAGGAAGT GAGAGAAGAC CAGCGCAGTC GTAGTGGGTT 

3751 AAGTAACAAA GCTGAGAAGC CAGGGAAATC CCTGGTCATG CAGGGCCTGT 

3801 GAGTCACGTC AGAGTGTTTG GGCTTTTGTT TTTCCTGGGA GCAGTCGATT 

3851 TTAAGCAGGG AACAGCTGTA TTCAGAGTTG GGAAGATCCT GTGGTTGCTG 

3901 CCTGAAGGGG ATGAAACTGG AGGCTAGGAG CCCAGGGTGA TAGGGAGGAT 

3951 CCAGGGTGAT GGGGAGGCTG GGAGGTCCGC GGTGATGGAC CAGGGCTGGG 

4001 GCCAGGGGAT GGGGAGGAAG GAGTAATTGG GAGAGGCCTG GGGCTCTGGC 

4051 CGAGGAATGG ATGGTGGGCT GAAACAGGGA GAGGAGAGAT GCTTAGGCCA 

4101 CTTTGGAACA CAGTAGGGCA AGGACAGGAG ACACCCAAGG GGAAGTGCCC 

4151 AAGAGACCAC GACAGGCTGG CATTGGACAG GGAAGGTCTG TCTGGAGCAG 

4201 GTGTCTTGGA TAAGGGAGGA AAATGGTGCA GTTCCATCCT CCTCCCTCTC 

4251 TGTTCAACCT CTAAACTACA TGGGGCACAG GACCCAGTGG GACTCCATAA 

4301 AT GAT GG GAT GGGTGGATGG AAGGAAGGAA GGAGGAAACA ACTCTTCATT 

4351 CATCCTGGTT ATTTACAGAA CAGGCCAGGT GCGGTGCTCA CGCTTGCCAT 

4401 TCTAGCACTT TGGGAGGCTG AGGTGGGTGG ATTACCTCAG GTCAGGAGTT 

4451 CAAGACCAGC CTAGACAACG TAGAGAAACC CCATCTCTAC TGAAGATATA 

4501 AAATTAGCTG GGCGTAGTGG CATATGCCTG TAATCCCAGC TAGTCGGGAA 

4551 GCTGAGGCAG GAGAATCGCT TGAACCCGAG AGGCAGAGGT TGCGGTGAGC 

4601 TGAGATCGTG CCATTGCACT CCAGCCTGGG TGACAAAGCA AGACCTCGTC 

4 651 TCAATAATAA TAATAATTAC AAAACAGAAG GAGCCTGGGT CATCCCAGCT 

4701 ACCTACTTTT CAGGAGAATG TACTCCCTTA CCCAAGGGCA AAGGATGGGA 

4751 GAACCAGTTT GATTATGCAT TTATTGAGCA CCTACTGAGT CCTCATCCCT 

4801 GGGCTAGGCT GGAATGGACT CAGATGGAGC CTGAAGAGTC CCCCTCAGGG 

4851 AACCTCACTA GAAAGAAGGA GGAATCGGCC GGGCGCGGTG GCTCACGCCT 

4901 GTAATCCCAA CACTTTGGGA GGCTGAGGTG GGTGGATCAC AAGGTCAGGA 

4951 GATCGAGACC ATCCTGGCTA ACACAGTGAA ACCCCATCTC TACTAAAAAT 

5001 ACAAAAAATG AGCCAGGCAT GGTGGCGGGC GCCTGTAGTC CCAGCTACTC 

5051 AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCA GAGGTTGCAG 

5101 T GAG AC GAGA TCACGCCACT GCACTCCAGC CTGGGCAACA GAGCGAGATT 

5151 CCGTCTCAAA AAAAAAAAGA AAGAAAGGAA GAAGGGGGAA TGGGGGAGAG 

5201 GGGCCGGTCC CTTTTTGAGT CTAGCCTTCT GCGCAGGGGT TTTCTTCTCC 

5251 AACGGGGAGC GGTGGAGGCA GCTGAGGAAG TTTACCATGC TTGCTCTGCG 

5301 GGACCTGGGC ATGGGGAAGC GAGAAGGCGA GGAGCTGATC CAGGCGGAGG 

5351 CCCGGTGTCT GGTGGAGACA TTCCAGGGGA CAGAAGGTCA GCATGGCGGG 

5401 GTCACCCCAG GGTCTCCAGC CGAGTGAAAG GGAAAACTCT CCTACTGTGG 

5451 CTGGGGGTGG CCCCAACCCA GGTCCTGGAA TGGGCAGGAG GGGAAGCCTT 

5501 GAACTCTAGG GCTGGCCTGG GGGTTCTGTT CACTGCCACC TTCTGTCTCT 

5551 GTCCCACTGT CTCTCCGAGG CTGTCATGAC ATCTCTCTGT GTGTCTCTGG 

5601 TGCTATCATC CCATTCTTCC TGGGTCTCCA TCTCTCTCTC TGTCTCTTTT 

5651 CTTTCTCTCT CCTTTCCTCT ATTTTTTGGG CCCTCAGTCT ATCTCTGTTT 

5701 CTGTCTCCCT GTCTGTGTGA TGGTCACTCT GTTTCTTTCT CCCTGTCTGT 

5751 TTCTCTGTCC CTATCTGTCT GTATCCTTCT TTGCCTGTTT AGCTCTCTCC 

5801 CTGCGCTGTC CATCCATCTT TCCCTGCCTC CCTGTCTCTC TCTGGTTGGG 

5851 TTCAGCCCCA ACCTGCTCCC CTCTGCCTGG CTCCATCACA GCCTACCTCC 

5901 CTGCCCCCAT TCCCCCCAGG ACGCCCATTC GATCCCTCCC TGCTGCTGGC 

5951 CCAGGCCACC TCCAACGTAG TCTGCTCCCT CCTCTTTGGC CTCCGCTTCT 

6001 CCTATGAGGA TAAGGAGTTC CAGGCCGTGG TCCGGGCAGC TGGTGGTACC 

6051 CTGCTGGGAG TCAGCTCCCA GGGGGGTCAG GTGAGTGGGT GGGACCCCTC 

6101 TCCAACTACC TTCCCTGAAG GTTCCTGCCA AGGTCCCATG AGAACTAGCT 

6151 GCCCTTCTCC CCACAGACCT ACGAGATGTT CTCCTGGTTC CTGCGGCCCC 

6201 TGCCAGGCCC CCACAAGCAG CTCCTCCACC ACGTCAGCAC CTTGGCTGCC 

6251 TTCACAGTCC GGCAGGTGCA GCAGCACCAG GGGAACCTGG ATGCTTCGGG 

6301 CCCCGCACGT GACCTTGTCG ATGCCTTCCT GCTGAAGATG GCACAGGTGT 

6351 GGGAAGGGTG CAGGGACCCC CTCTCTGAAT GGGCGTGGTG ACCTGGCAGG 

6401 TCCCAGCCAG GTGTCCCTGG GGACCTCAAT TGGGTTCCTC TCTCTTTCTC 

6451 TCTCTGCATG TCTCTGTGAG TATGAGTGTC TCTGTGCATG TGTGTGCATC 

6501 CCTTCTCTGC ACATCTGTGC TGGCCCTTTC AGGGCGTTGC TCTCACTGCC 

6551 TCTCCCGCCC CCGACCTGGG CATTTGTGCC GGGCTGTCTG TCTCTCCAGC 

6601 ATCTCTCCTC TTTCTCCCTC CCACCTCGGC CCTTGTGTTC AGGCCCCATG 

6651 CCCAGGGTCC TACACCAGCA ATCCCCAGGA TCACTTCATC CCATCCCCTG 

6701 CAGCCTCCCC AGACTTTTAT GTAAATTCAC AATTTTATGT GAATTATGGT 

6751 CAT T TAT TAG GAAGCCTTGC AATATCAAGT TATGTTAATA AAGTCCACTT 

6801 TATTAATTAT ATAAGAACAA TATTTCTTTT CCTTTTTTTT TTCTTTTCTT 

6851 TTTAAAGAGA CAGGATCTCT TTCTGTTGCC CAGGCTAGAG TACAGTTGCA 

6901 AAATCATAGC TCACTGCAAC CTTGAACTCC TGGGCTCAAG CAATCCTCCT 

6951 GCCTCGGGCT CCTGAGTAGC TGGGACAACA GGTGTGCACC ACCACACCTG 

7001 GCTAAATTTT TTTTTTTTCT TTGTAGAGAT AGACTCTCAC TATGTTACCC 

7051 AGGCTGGTCT TGAATTCCTG GGCTCATGTA ATCCTCCTGC TGCCTTGAAC 
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7101 TCCCAAAGTG CTGGGACTAT AGGCATAAGA CATCATGCCC GGTCGGGCAC 

7151 AGTGGCTCAT GCCTGTAATC TCAGGACTTT GGGAGGCCGA GACGGGCGGA 

7201 TCACCTGAGG TCGGGAGTTC GAGACCAGCC TGACCAACAT GGAGAAACCC 

7251 CATCTCTACT AAAAAAAAAA ATACAAAATT AGCCGGACGT GGTGGCACAT 

7301 GCCTGTAATC CCAGCTACTA GGGAGGCTGA GGCAGGAGAA TCGCTTGAAC 

7351 CCGGGAGGCT TAGGTTGCGG T GAG CT GAGA TTGCACCATT GCACTCCAGC 

7401 CTGGGCAACA AGAGCGAAAT TCCATCTCAA AAAAAAAAAA AAAGAAAAAA 

7451 AGAAAAAAGA CACCATGCCC TATAAGTAAA CTAGAATTAA GGTGACTCCT 

7501 AAGGAAATAA ATAGTTTTTA ACTGTACGAA CTTTTGGAAG AATGGGGCCA 

7551 ATTCTTTAAT TAAATGCAGC CTCCCTGTTT GTGGAGAAAG AAAAATTTTT 

7601 CTTAACCCTA TTGCCCCATT TCTTTTCTCT TTTATTGAAT ATTTTTTAGT 

7 651 TTTAACTATA GTAAAATACA CATAACGTTT ACCATCTTAA CCATTTTTAG 

7701 GTATACAGTA CAGTAGTGTT CAGTACATTC ATACTGTTAT GCAATCAGTC 

7751 TCCAGAACTC TTCATGTTGC AAAGCTGAAA CTCTATACCC ATTAAACAAC 

7801 TGCCTGTTCC TCCCTCCTCC AACCCCTGGC AATCACCTTT TTTTTTTTGA 

7851 GACGAAGTCT CACTCTGTCA CCCAGGCTAG AGTGCGGTGG CTCGATCTCG 

7901 GCTCACTGCA AGCTCCGCCT CCCGGGTTCA TGCCATTCTC CTGCCTCAGT 

7951 CTCCCAAGCA GCTGGGACTA CAGGTGCCCG TCACCACGCC TGGCTAATTT 

8001 TTTGTATTTT TAGTAGAGAT GGAGTTTCAT CGTGTTAGCC AGGCTGATCT 

8051 CAAACTCCTG GCCTCAAGTG ATCCACCCGC CTCGGCCTCC CAAAGTGCTG 

8101 GGACTACAGG CGTGAGCCAC TGTGCCTGGC CAGGAAGTAG ACTCTTGATA 

8151 TTAGTTCTCT CTGGTTGAAA TGTTTTTAAA AATGAAAGAG AATGACTAAT 

8201 AACAAAAACA CAGAAAGTTA TAAGGATTGA TGAAGATGTG GAGACTTTGA 

8251 AACCCATGTA TACCATTGGT GGGAATGTGA AACGACGCAG CCCTGTGGAA 

8301 AATGGTACAG CAGTTACCTG AGGTCAGGAG TTTGAAACCA ACCTGGCCAA 

8351 CATGCAGAAA CCCCGTCTCC ATTAAATGTA CAAAAATTAG CCAGGNNNNN 

8401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

8951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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10651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

12601 NNNNNNNNNN NNNNNNNNNN NNNNNNCTCT TTCTTCCTTT CTTCCTTCCT 

12651 CCCTTCCTCC CTTCCTCCCT TCCTGTCTTC CTCTCTTTCT CTCTTTCTTT 

12701 CTTGACAGGG TCTCTCTTTG TCTCCCAGGC TGGGGTGCAG TGGTACAAGC 

12751 ATAGCTCACA GCAGCCTTGA ACTCCTAGGC TCAAGTGATC CTCCCACGTC 

12801 AGCCTCCTGA GCAGCTGGGA CAACGGGCTC ATACCACCAT GCCTGGCTAA 

12851 TTTTTTAATT TTTCGTAGAG ACAAGGTCTT GTTATATTGC CCAGGCTGGT 

12901 CTCAAACTCC TGGGCTCAAA TGCTTCTCTC ACCTCAGCCT CCCACGTGGC 

12951 TGGGATTACA GGCATGAGCC ACTGCACGCC ACTCAACACT CCACAAATGT 

13001 TGATGCCATT ATGTTTTGTG AACTAGTGTC CCTGGCACCC GAGACTTGTA 

13051 CTCCACACTC GAGGACCAAA TAGACTGGGG TGGGAAGGGG TTTATAGTTT 

13101 CAT TAT T ATT TCCCCTCAGG GCACGGAGGT CTTCCCCCTC CTTGGCTCCA 

13151 TCCTGCATGA CCCCAACATC TTCAAGCACC CAGAAGAGTT CAACCCAGAC 

13201 CGTTTCCTGG ATGCAGATGG ACGGTTCAGG AAGCATGAGG CGTTCCTGCC 

13251 CTTCTCCTTA GGTATCTGCT GCAGCCCTGG GTATCACAAG CAGGTGCTGG 

13301 CGAACTCCAG GCATCTGTGC CAGCTGGGGG CACCCTTCTG CACCCTGGGC 

13351 TTACTGTTGG CTCCTCCACC TGCTGTTCCC CCCGTGGGCC TGGGTGTGAG 

13401 GAATACTGAC TCAGCCCTCT CTCTCTCTCT CTCCTCACCA GGGAAGCGTG 

13451 TCTGCCTTGG AGAGGGCCTG GCAAAAGCGG AGCTCTTCCT CTTCTTCACC 

13501 ACCATCCTAC AAGCCTTCTC CCTGGAGAGC CCGTGCCCGC CGGTACACCC 

13551 TGAGCCTCAA GCCCACCGTC AGTGGCCTTT TCAACATTCC CCCAGCCTTC 

13601 CAGCTGCAAG TCCGTCCCAC TGACCTTCAC TCCACCACGC AGACCAGATG 

13651 AAGGAAGGCA ACTTGGAAGT GGTGGGTGCC CAGGACGGTG CCTCCAGCCT 

13701 CAACAGTGGG CATGGACAGG GTTAATGTCT CCAGAGTGTA CACTGCAGGC 

13751 AGCCACATTT ACACGCCTGC AGTTGTTTTC CGGAGTCTGT CCCACGGCCC 

13801 ACACGCTCAC TTGACTCATG CTGCTAAGAT GCACAACCGC ACACCCATAC 

13851 ACAACTACAA GGGCCACAAA GCAACTGCTG GGTTAGCTTT CCACAGACAT 

13901 AAATATAGTC CATCTGCAAT CACAAGCACA TAGCCAGGTA ACCCACCAAC 

13951 TCCCCTGGAT CTGCAGCCCA CACGTGGGAG TCTGGCTGTC ACCTTCACAA 

14001 GCCACAGAAA CGGCCACACA TGTTCACAGC TCACACGCCC TCTCCATTCA 

14051 TCGAACTTCT CAGTGTCCCT GTCCCTGGTG CCTGGCACAG GGAACAGCAT 

14101 GCCCCCTCCG GGGTCATGCC ACCCAGAGAC TGTCGCTGTC TATGGCCCCA 

14151 ACTCATGCTC CCTCTCTTGG CTACACCACT CTCCCAGCCT GTGACCACCG 
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14201 ATGTCCACAC ACCCCCAACC ACTTGTCCAC ACAGCTACCC ACGTACGACA 

14251 TCGTCCTGGC TCCCCAGAGT ATCTTCCCAC TGAGACACGC CGCCCCCACA 

14301 GAGGCACAGT CCCCAGCCAC CTCTGCAACT GCAGCCCTCA GTCACCCCTT 

14351 TTTAAGCACC CTGATTCTAC CAAATGCAAA CACATCT GGG TCTGCGATTA 

14401 TGCACAGAGA CTTTGGACAT ACGAGGACCC T C AG AC C GG A GGAACACCTG 

14451 CCCAACCCCA ACACGTGCTT ATGTAACCAC GTGGAAAGCG GCCCCTGCTG 

14501 CCCCTCCACA CACACATACA CACTCACTGA TCTACAGCCC CTGTTCGGCG 

14551 TCAGAGTCCC CACTAGACCC AGTGGAAGGG GTTAGAGACC AAGTAGGGGC 

14601 CAGTTTCCAA TTCACCCTGT CAGGGAGTGA GCCGGATCTG ACGTTCCTTG 

14651 TGACTTAAGG GTCCGGCTTG GGAATTAAAG TTTGTTTCTG GCCTTTAGCC 

14701 TACTGCGTGT GTGACCCGTG TCAGTCACTG TGAGTAAGGG GTGGGGACAG 

14751 GGGAGTCCAC CCCTCCCCTG AGGCTGGGCG GGAGCTGAAA AACATGGCCA 

14801 CCGCCCACCC TGGCTGTTGA CATCAGGACC AGATGTGGAG CTGGGAGGAG 

14851 GGGC AGGGCT GGTGACGCCC TGGGCCTCAT TTCCAAAAAG GGCCAAGGTG 

14901 TCCGGCGGTG GGAAGTGGGC AAGGAGGGGG TAACCCAAGC TGGACTGTGG 

14951 ACCTTGGGGG CTTCCTCAGC CAGGGAGAGC CTGAAGCCAA CTAGATCCAG 

15001 ACCCTAGAGA CTCTTCAAAC TTGAGTACAG GAACTAGCTT GCAACACAGA 

15051 CTCTAAGCCC ACTCCCATTT CTTCCACCCT TTTTCTCTTG CCTCCCCTTC 

15101 ACAAGGAAAC CAGAGGCATT TGTAAATTTC CTTTCTTTTT TTTTTTTTTT 

15151 TTTTTTTTGA GACGGAGTCT CACTCTGTCA CCCAGGCTGG AGTGCAGTGG 

15201 TGTGATCTTG GCTCACTGCA GCCTCCGCCT CCGGGTTCAA GCCATTCTCC 

15251 TGCCTCAGCC TCCCAAGTAG CTGGGATTAC AGGTGTGTGC CACCACGCCC 

15301 AGCTAATTTT TGTATTTTTA GTAGAGATGG GGTTTCACCA TGTTGGCCAG 

15351 GCTGGTCTCG AACTCCTGAC CTCAGATGAT CTGCCAGTCT CGGCCTCCCA 

15401 AAATGCTGGG ATTACAGGCG TGAGTCGCTA CTAGATAAAT TTCTTATCTA 

15451 GCAAAGAAGT TTGCAAACAT ACGCAAAAGT AGAAAGATAC AATGAGCCCC 

15501 CAGGTGCCCA TCACCCAGCC TCATTTCAAT AGTCATCAAC TTTCTGCAGC 

15551 TTTTACTTCA TCTATATCCT TTTCTGCCTC TTTTTTTTTT TTTTTATTTT 

15601 GAGATAGGGT TTTGCTTTGT TGCCCAAGCT GGGGTGCAGT AGCATGATCT 

15651 CATAGTTCAC TGGGGCTTCA GACTCCTAGG CTCAAGTGAT CCTCCCGCCT 

15701 CGGCCTCCAA GCAGCTGGGA CTACAGATGC GTGCCACCAC ACCCAGCTAA 

15751 ATTTCTTATT TTTATTTTCT ATAGAGAAAG TCTCACTATA CAGCCCTGTG 

15801 CTGGTCTCAA ATTCCAGGCC TCAAGAGTTT CCATCCCAGC CTCCCAAAGT 

15851 GCTGGGATTA TAGGCGTGAG TCACTGCACC CTGCCCTAAT ATTTTTATTT 

15901 TATCTATTGC TTTTTATTTA CTTATTTATC TTTTATTTTT GAGACAGAGT 

15951 CTCACTCTGT GGCCCATGCT GGAGTGCAGT GGCATCATCT CGGCTCACTG 

16001 TAACCTCCGC CTCTTAGGTT CAAGCAGTTC TCCTGCCTTG ACCTCCCGAG 

16051 TAGCTGGAAT TACAGGTGCC TGCCACCAAG CCTGGCTAAT TTTTTTATTT 

16101 GTAGTAGAGA TGGGGTTTTG CCATGTTGAC CAGGCTGGTC TCGAACTCCT 

16151 GACCTCAGGT GATCTGCCCA CCTTGGCCTC CCAAAGTGCT G AG AT TACT G 

16201 GTATGAGCCA CCGTGCCTGG CCACCTATTG CTTTTTAAAG ATTATTTTTT 

16251 T ATT ATT ATT ATTTTTTTAT TTGCAGATGG AGTTTCGCTA TTGTTGCCCA 

16301 GGCTGGAGTG CAATGGCGTG ATCTCAGCTC ACCGCAACCT CCGCCTCCCA 

16351 GGTTCAAGCG ATTCTCCTGC CTCAGCATCC CTAGTAGCTG GGATTACAGG 

16401 CATGCACCAC CATGTCCAGC TAATTTTGTA TTTTTAGTAG AGACGAGGTT 

16451 TCTCCAGGTT GGTCAGGCTG GTCTCAAACT CCCAACCTCA GGTGATCCGC 

16501 CCATCTCGGC CTCCCAAAGT GCTGGGATTA CAGGTGTGAG CCACCGCGCC 

16551 TGGCCTTAAA GATTATTTTA AGGCAAATTA CAGAAAGCAA TTTAATGCAC 

16601 ATT TCT GAGA GTTAAAGATA TTTTTGCCCT TGACATTTTA TGAGGACAGT 

16651 TTTCAAACAT GCAGCAAAGT TGAGGGAATT GTACAAGGAA CACCTTGTGC 

16701 ACTCCTGCCT CAGTCTCCCA AGCAGCTGGG ACTACAGGTG CCCGTCACCA 

16751 CGCCTGGCTA ATTTTTTGTA TTTTTAGTAG AGATGGAGTT TCATCGTGTT 

16801 AGCCAGGCTG ATCTCAAACT CCTGGCCTCA AGTGATCCAC CCGCCTCGGC 

16851 CTCCCAAGTG CTGGGACTAC AGGCGTGAGC CACTGTGCCT GGCCAGGTAA 

16901 GTAGACTCTT GATATTAGTT CTCTCTGGTT GAAATGTTTT TAAAAATGAA 

16951 AGAGAATGAC TAATAACAAA AACACAGAAA GTTATAAGGA TTGATGAAGA 

17001 TGTGGAGACT TTGAAACCCA TGTATACCAT TGGTGGGAAT GTGAAACGAC 

17051 GCAGCCCTGT GGAAAATGGT ACAGCAGTTA CCTGAGGTCA GGAGTTTGAA 

17101 ACCAACCTGG CCAACATGCA GAAACCCCGT CTCCATTAAA TGTACAAAAA 

17151 TTAGCCAGGC ATGGTGGTGC GCACCTGTAA TCCCAGCTAC TCGGGAGGCT 

17201 GAGGCAGGAG AATTGCTTGA ACCCAGGAGG CGGAGGTTGC AGTGAGCCGA 

17251 GATCGTGCCA CTGCACTCAG CCTGGGCAAC AAAGCAAGAC TCTGTCTCAA 

17301 AAAAAAAAAG TCTACTTCCC AACCTTCCCA AAAATTTATC TAAACCCCGT 

17351 GACAAAACTT TAACTTGTGT TTCCGACCCC AGGCTTGGCT GTTCTGGACA 

17401 TTTACTTCCC AAAGGCTGTG TTCTCTCAGC CCCTCTGCCT GGTTTCTTTC 

17 451 AGGAGGAACA AAACCCAGGC ACAGAATTCA CCAACAAGAA CATGCTGATG 

17501 ACAGTCATTT ATTTGCTGTT TGCTGGGACG ATGACGGTCA GCACCACGGT 

17 551 CGGCTATACC CTCCTGCTCC TGATGAAATA CCCTCATGTC CAAAGTAAGA 

17 601 GCCTTTTCCA CTTGCCAGGC CTTGGGAACA GAAGTCAGGG TTCTAGGCTG 

17 651 AGCAAGGTGG CTCACGCCTA TAATCCCAGC ACTTTGGGAG GCTGAGGCGG 

17701 GCTGATCACT TGAGAATAGG AGTTTAAGAC CAGCCGGCCA ACACAGTGAA 
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4D 



17751 AC 

FEATURES: 

Start: 

Exon: 

Intron: 

Exon : 

Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Stop: 

SNPs: 



DNA 

Position Maj 



2226 


G 


2226 


C 


3081 


A 


3788 


A 


3979 


T 


5056 


T 


5213 


T 


5508 


A 


5857 


C 


6385 


C 


6813 


C 


7853 


C 


12973 


G 


12973 


A 


13012 


G 


13072 


A 


13370 


C 


13682 


C 


14631 


C 



1999 6349 

1999 2167 

2168 5236 

5237 5386 

5387 5919 

5920 6080 

6081 6166 

6167 6349 

6354 13542 

6354 6636 

6637 13087 

13088 13261 

13262 13441 

13442 13542 

13547 13648 

13547 13648 
13649 



Minor 

C A 
A G T 
T G C 
T 

C G 
C G 
G A C 
C T G 
GAT 
T G 
T A 
T 

C T 
G T 
ACT 
T C G 
G 

T A 
T G 



Context : 

DNA 

Position 
2226 



2226 



TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
TCCTCCCTACCCCCAGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTACCTGGGACC 
CTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGCCCTGGGAGGTCA 
GGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGGCCA 
TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 

CCT TT GC AC AT GGCT T AGTCC CT C T GT T GC CTC AT C T GT C AAAT GG AGT G AT AAC AGT G C 
CCATCAGCCGGGTGCAGTGGCTAGTGCCTGAAATCCCAACACTTTGGGAGGCGGAGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
C T AC T AAAAAT AT AAAAATT AGCT GGGC AT GGT GGT GC GT AC CT GT AAT CC C AG AT AC T T 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
TCCTCCCTACCCCCAGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTACCTGGGACC 
CTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGCCCTGGGAGGTCA 
GGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGGCCA 
TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 

[C,A,GsT] 
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CCTTTGCACATGGCTTAGTCCCTCTGTTGCCTCATCTGTCAAATGGAGTGATAACAGTGC 
C CAT C AGCCG GGT GC AGT G GCT AGT GC CT G AAAT C C C AAC ACT T TGGGAG GC GGAGGT GG 
GT GG AT C ACT T G AGGTC AGG AGTT C G AGAC C AGC CT GGC C AAC ATGGT GAAAC C C T GTC T 
CTACTAAAAATATAAAAATTAGCTGGGCATGGTGGTGCGTACCTGTAATCCCAGATACTT 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

3081 AAAC AGT T CT CT G T AT T AC AAC AGAAAGC AGG AGGC C C A TGC T GGGTGC T GCC AGGAACT 

CAGTAGTAACTAAGACAGCACCGGTGCTGCTTCCCCAGCGCACCTAGGCCAGTGGGGAAA 
CAGACTCACCACACAGTCCCAGCCCAGAGTGGTCAGGGCCAAGATGGGGAAGCACGGGGA 
GAAAGGTCAGGGTGGGATGGGGAGGGGTCAGGGCAAGAGGGGTCAGGGCCAGGCTGAGGG 
AAGCCCTGGGACTGTAGGAATTTAGAGGAGGTACCTGACCCGGCATGTTTGGTGAGGGAG 
[A,T,GSCJ 

TTCAGGAAGTCTTCCTGGAAGAGAGGCTGTCGGAGCTGAGACTCATAAGATGAGTGGGGA 
GGGTGTTCCAGGCAGAAAGACCAGCACCTACAAAAGCATGACTTTGAGAGAAGCATTCAT 
CCATTCAACTGATGAATTTTCAGACTGGGCACGCTGGCTCATGCCTGTAATCCCAGCACT 
TTGGAAGGCTGAATGGGGAGGATGACTTGAGCCTAGGCATTTGTGACAAGCCTGGGCAAC 
ATGGTGAGACCCTGCCTCCACAAAACAAACAAACAAACAAAAAATCATTATACCTGGTAC 

3788 ATCCT AAAATGAGAGGT ACAGATTTGAGAGCAAACAC AGGGCACAGGCAT ATGTACGAGG 

GTAAAGAGGGAATCAGGGAAGGCTTCTCAGAGAAGGTGACATTTAAGCCGGGACATGAAG 
GATGAACGAGTTAGTTCACCAAGGATGGGATGGAAAGGGGTGAGAGTGATGGAGGCAGAG 
GGAACTGCAGGATCATAGGCCTAGACAGGGGATCCTGACGCCCTTGAGGAAGTGAGAGAA 
GACCAGCGCAGTCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTC 
[A, T] 

TGCAGGGCCTGTGAGTCACGTCAGAGTGTTTGGGCTTTTGTTTTTCCTGGGAGCAGTCGA 
TTTTAAGCAGGGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGG 
GGATGAAACTGGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGC 
TGGGAGGTCCGCGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATT 
GGGAG AGGCC TGGGGCT C TGGC CGAGGAAT GGAT GGT GG GCT GAAAC AGGG AG AGG AG AG 

3979 ATCATAGGCCTAGACAGGGGATCCTGACGCCCTTGAGGAAGTGAGAGAAGACCAGCGCAG 
TCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTCATGCAGGGCCT 
GTGAGTCACGTCAGAGTGTTTGGGCTTTTGTTTTTCCTGGGAGCAGTCGATTTTAAGCAG 
GGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGGGGATGAAACT 
GGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGCTGGGAGGTCC 
[T,C,G] 

CGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATTGGGAGAGGCCT 
GGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAGGAGAGATGCTTAGGCC 
ACTTTGGAACACAGTAGGGCAAGGACAGGAGACACCCAAGGGGAAGTGCCCAAGAGACCA 
CGACAGGCTGGCATTGGACAGGGAAGGTCTGTCTGGAGCAGGTGTCTTGGATAAGGGAGG 
AAAATGGTGCAGTTCCATCCTCCTCCCTCTCTGTTCAACCTCTAAACTACATGGGGCACA 

5056 AGTTTGATTATGCATTTATTGAGCACCTACTGAGTCCTCATCCCTGGGCTAGGCTGGAAT 
GGACTCAGATGGAGCCTGAAGAGTCCCCCTCAGGGAACCTCACTAGAAAGAAGGAGGAAT 
CGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGTGG 
ATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAAACCCCATCTCTACTA 
AAAATACAAAAAATGAGCC AGGC ATGGTGGC GGGC GC CT GT AGT CC C AG CT ACTC AGG AG 
[T,C,G3 

CTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGACGAGATCACGC 
C AC TGC AC TC CAGCC TGGGCAACAGA GCGAGATT C C GT C T C AAAAAAAAAAAG AAAGAAA 
GGAAGAAGGGGGAATGGGGGAGAGGGGCCGGTCCCTTTTTGAGTCTAGCCTTCTGCGCAG 
GGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCTGAGGAAGTTTACCATGCTTGCTC 
TGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGAGCTGATCCAGGCGGAGGCCCGGT 

5213 CTT T GGGAGGC T G AGGT GGGTGG AT C AC AAGGTC AGG AG AT CG AG ACC ATC CT GGCT AAC 

ACAGTGAAACCCCATCTCTACTAAAAATACAAAAAATGAGCCAGGCATGGTGGCGGGCGC 
CTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGA 
GGTTGCAGTGAGACGAGATCACGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCC 
GTCTCAAAAAAAAAAAGAAAGAAAGGAAGAAGGGGGAATGGGGGAGAGGGGCCGGTCCCT 
[T, G f A t C] 

TTTGAGTCTAGCCTTCTGCGCAGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCT 
GAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGA 
GCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGGTCAGCA 
TGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTGTGGCTG 
GGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCTAGGGCT 

5508 TCC C T T TTTG AGTCT AGCC T TC TGC GC AGGGGTT TT C T T CT C C AACGG GG AGC GGT GG AG 

GCAGCTGAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGG 
CGAGGAGCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGG 
TCAGCATGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTG 
TGGCTGGGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCT 
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[A,C,T,G] 

GGGCTGGCCTGGGGGTTCTGTTCACTGCCACCTTCTGTCTCTGTCCCACTGTCTCTCCGA 
GGCTGTCATGACATCTCTCTGTGTGTCTCTGGTGCTATCATCCCATTCTTCCTGGGTCTC 
CATCTCTCTCTCTGTCTCTTTTCTTTCTCTCTCCTTTCCTCTATTTTTTGGGCCCTCAGT 
CTATCTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGTTTCTTTCTCCCTGTCT 
GTTTCTCTGTCCCTATCTGTCTGTATCCTTCTTTGCCTGTTTAGCTCTCTCCCTGCGCTG 

5857 CTGTCTCTCCGAGGCTGTCATGACATCTCTCTGTGTGTCTCTGGTGCTATCATCCCATTC 
TTCCTGGGTCTCCATCTCTCTCTCTGTCTCTTTTCTTTCTCTCTCCTTTCCTCTATTTTT 
TGGGCCCTCAGTCTATCTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGTTTCT 
TTCTCCCTGTCTGTTTCTCTGTCCCTATCTGTCTGTATCCTTCTTTGCCTGTTTAGCTCT 
CTCCCTGCGCTGTCCATCCATCTTTCCCTGCCTCCCTGTCTCTCTCTGGTTGGGTTCAGC 
[C,G,AsT] 

CCAACCTGCTCCCCTCTGCCTGGCTCCATCACAGCCTACCTCCCTGCCCCCATTCCCCCC 
AGGACGCCCATTCGATCCCTCCCTGCTGCTGGCCCAGGCCACCTCCAACGTAGTCTGCTC 
CCTCCTCTTTGGCCTCCGCTTCTCCTATGAGGATAAGGAGTTCCAGGCCGTGGTCCGGGC 
AGC TGGT GGT AC CCT GCTGGGAGT CAGCT CC C AGG GGGGT C AGGT G AGTGGGTGGGAC CC 
CTCTCCAACTACCTTCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAACTAGCTGCCCTTC 

6385 GTGGGTGGGACCCCTCTCCAACTACCTTCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAA 
CTAGCTGCCCTTCTCCCCACAGACCTACGAGATGTTCTCCTGGTTCCTGCGGCCCCTGCC 
AGGCCCCCACAAGCAGCTCCTCCACCACGTCAGCACCTTGGCTGCCTTCACAGTCCGGCA 
GGTGCAGCAGCACCAGGGGAACCTGGATGCTTCGGGCCCCGCACGTGACCTTGTCGATGC 
CTTCCTGCTGAAGATGGCACAGGTGTGGGAAGGGTGCAGGGACCCCCTCTCTGAATGGGC 
[C,T,G] 

TGGTGACCTGGCAGGTCCCAGCCAGGTGTCCCTGGGGACCTCAATTGGGTTCCTCTCTCT 
TTCTCTCTCTGCATGTCTCTGTGAGTATGAGTGTCTCTGTGCATGTGTGTGCATCCCTTC 
T CTGC AC AT CT GTGCT GGC CCT T T C AGGGCGTT GC T CT C AC T GCCT C T CC CGC CC CCGAC 
CTGGGCATTTGTGCCGGGCTGTCTGTCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACC 
TCGGCCCTTGTGTTCAGGCCCCATGCCCAGGGTCCTACACCAGCAATCCCCAGGATCACT 

6813 ATCTGTGCTGGCCCTTTCAGGGCGTTGCTCTCACTGCCTCTCCCGCCCCCGACCTGGGCA 
TTTGTGCCGGGCTGTCTGTCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACCTCGGCCC 
T T GT GT T C AGGCCCC AT GCCC AG GGT CCT AC AC C AGCAAT CC CC AGG ATC AC T T CAT C CC 
ATCCCCTGCAGCCTCCCCAGACTTTTATGTAAATTCACAATTTTATGTGAATTATGGTCA 
T T T AT T AGG AAGCCTT GC AAT AT C AAGTT AT GT T AAT AAAGT C CAC T T T ATT AAT TAT AT 
[C,T,A] 

AGAACAATATTTCTTTTCCTTTTTTTTTTCTTTTCTTTTTAAAGAGACAGGATCTCTTTC 
T GTT GC CC AGGC T AGAGT AC AGTT GC AAAAT C AT AGCT C AC T GC AAC C TTG AAC T CCT GG 
GCTCAAGCAATCCTCCTGCCTCGGGCTCCTGAGTAGCTGGGACAACAGGTGTGCACCACC 
ACACCTGGCTAAATTTTTTTTTTTTCTTTGTAGAGATAGACTCTCACTATGTTACCCAGG 
CTGGTCTTGAATTCCTGGGCTCATGTAATCCTCCTGCTGCCTTGAACTCCCAAAGTGCTG 

7853 TCTT T AATTAAATGC AG CCTCCCTGTTT GT GGAG AAAG AAAAATT T TTCT T AACCCT ATT 

GCCCCATTTCTTTTCTCTTTTATTGAATATTTTTTAGTTTTAACTATAGTAAAATACACA 
TAACGTTTACCATCTTAACCATTTTTAGGTATACAGTACAGTAGTGTTCAGTACATTCAT 
ACTGTTATGCAATCAGTCTCCAGAACTCTTCATGTTGCAAAGCTGAAACTCTATACCCAT 
TAAACAACTGCCTGTTCCTCCCTCCTCCAACCCCTGGCAATCACCTTTTTTTTTTTGAGA 
[C,T] 

GAAGTCTCACTCTGTCACCCAGGCTAGAGTGCGGTGGCTCGATCTCGGCTCACTGCAAGC 
TCCGCCTCCCGGGTTCATGCCATTCTCCTGCCTCAGTCTCCCAAGCAGCTGGGACTACAG 
GTGCCCGTCACCACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGATGGAGTTTCATCGT 
GTTAGCCAGGCTGATCTCAAACTCCTGGCCTCAAGTGATCCACCCGCCTCGGCCTCCCAA 
AGTGCTGGGACTACAGGCGTGAGCCACTGTGCCTGGCCAGGAAGTAGACTCTTGATATTA 

12973 CTGTCTTCCTCTCTTTCTCTCTTTCTTTCTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 
GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 
CCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 
TTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 
GGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[G,C,T] 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGTCCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
TGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 
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12973 CTGTCTTCCTCTCTTTCTCTCTTTCTTTCTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 
GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 
CCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 
TTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 
GGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[A,G,T] 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGTCCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
T G C AT GAC C C C AAC AT CT T CAAGC ACC C AGAAGAGT T CAACC C AGACCGTT T CCT GGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 

13012 CTCTCTTTGTCTCCCAGGCTGGGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAA 
CT CC T AGG CT C AAGT G ATC CT C CC ACGT C AGCCT CCT GAGCAGCT GGGAC AACG GGC T C A 
TACCACCATGCCTGGCTAATTTTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGGATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
[G,A,CsT] 

GTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATA 
GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCC AGAC CGTTTCCTG GATGC A GAT GGACGGT T C AGGAAGCATGAGGCGT T CC T GC C CT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 

13072 CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAATTTTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGG AT T AC AGGC AT G AGC C AC TGC ACGC C ACTC AAC ACT CCAC AAATGTTG ATGC CAT T A 
TGTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAAT 
[A,T,CsG] 

G ACT GGG GTGGG AAGGGG TTTAT AGTT T C AT TATTAT TT CCC CTC AGGGCAC GG AG GT CT 
TCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 
ATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCACCTG 

13370 AT AGACTGGGGTGGGAAGGGGTTT AT AGTTTC ATTATTATTTCCCCTC AGGGCACGGAGG 

TCTTCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGT 
TCAACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGC 
CCTTCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCA 
GGCATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCAC 
[C,G] 

TGCTGTTCCCCCCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCT 
CTCCT C AC C AGGG AAGC GT GTCTGCCT T GG AG AGGGC CTGGCAAAAGCGGAGCTC T T C CT 
CTTCTTCACCACCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCC 
TGAGCCTCAAGCCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCAAG 
TCCGTCCCACTGACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGT 

13682 CCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCTCTCCTCACCAG 
GGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCTCTTCTTCACCA 
CCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCCTGAGCCTCAAG 
CCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCAAGTCCGTCCCACT 
GACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGTGGTGGGTGCCC 
[C,T,A] 

GGACGGTGCCTCCAGCCTCAACAGTGGGCATGGACAGGGTTAATGTCTCCAGAGTGTACA 
CTGCAGGCAGCCACATTTACACGCCTGCAGTTGTTTTCCGGAGTCTGTCCCACGGCCCAC 
ACGCTCACTTGACTCATGCTGCTAAGATGCACAACCGCACACCCATACACAACTACAAGG 
GCCACAAAGCAACTGCTGGGTTAGCTTTCCACAGACATAAATATAGTCCATCTGCAATCA 
C AAGC AC ATAGCC AGGT AACCC ACC AACTC C CCT GGAT CT GCAGCC C AC AC GT GG G AGTC 
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14 631 GCAGCCCTCAGTCACCCCTTTTTAAGCACCCTGATTCTACCAAATGCAAACACATCTGGG 
TCTGCGATTATGCACAGAGACTTTGGACATACGAGGACCCTCAGACCGGAGGAACACCTG 
CCCAACCCCAACACGTGCTTATGTAACCACGTGGAAAGCGGCCCCTGCTGCCCCTCCACA 
C AC ACAT AC ACAC TC ACT GAT C T AC AGC CC CT GT T C GGC GT C AG AGT C CCC AC T AG AC CC 
AGTGGAAGGGGTTAGAGACCAAGTAGGGGCCAGTTTCCAATTCACCCTGTCAGGGAGTGA 
[C,T,G] 

CCGGATCTGACGTTCCTTGTGACTTAAGGGTCCGGCTTGGGAATTAAAGTTTGTTTCTGG 
CCTTTAGCCTACTGCGTGTGTGACCCGTGTCAGTCACTGTGAGTAAGGGGTGGGGACAGG 
GGAGTCCACCCCTCCCCTGAGGCTGGGCGGGAGCTGAAAAACATGGCCACCGCCCACCCT 
GGCTGTTGACATCAGGACCAGATGTGGAGCTGGGAGGAGGGGCAGGGCTGGTGACGCCCT 
GGGCCTCATTTCCAAAAAGGGCCAAGGTGTCCGGCGGTGGGAAGTGGGCAAGGAGGGGGT 

Chromosoiaal Map Position 
ePCR to dbSTS 

Site (bases) Marker Chr. Organism 

15155.. 15288 stSG46708 19 Homo sapiens 
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